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CLAIMS 




A method of detecting a facial region within 
a video comprising the steras of : 
5 (a) receiving a Virst frame of said video 

comprising a plurality of pixels; 

(b) receiving a subsequent frame of said video 
comprising a plurality of pixels; 

(c) calculating a difference image representative 
10 of the difference between a plurality of said 

pixels of said f irsV frame and a plurality of 
said pixels of said Wbsequent frame ; 

(d) determining a plurality of candidate facial 
regions within said difference image based on 

15 ' a transform of said difference image in a 

spacial domain to a parameter space; and 

(e) fitting said plurality of Vandidate facial 
regions to said dif f erence Vmage to select 
one of said candidate f acial\ regions . 

20 

2 . The method of claim 1 further comprising the 
step of thresholding said difference image thereby removing 
values of said difference image less than a threshold value. 

2 5 3. The method of claim 2 wherein said threshold 

value is a predetermined value and said removing values is 
setting said values of said difference image that are less 
than said threshold value to a selected value. 
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4. The method of claim 1 wherein said transform 
is a Hough transform. 
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5. 



The method of claim 4 wherein said Hough 



transform is 



A(x c/ y c/ r) = A(x c/ y c/ r)+l V x c/ y c/ r 8 (x-x c ) 2 + (y-y c ) 2 =r 2 . 



5 



6. 



The method of claim 1 where said fitting of 



each of said candidate facial regions is based on a 
combination of at least three factors including, a fit 
factor representative of a fit of said candidate facial 
regions to said difference image, a location factor 
10 representative of the location of said candidate facial 

regions within said video, and a size factor representative 
of the size of said candidate facial regions. 



15 step of scaling said first frame and said subsequent frame 

of said video to reduce the number of said pixels of said 

first and subsequent frame prior to said calculating said 
difference frame. 



determining said plurality of said candidate facial regions 
and fitting said plurality of said candidate facial regions 
further comprises the steps of: 



7. 



The method of claim 1 further comprising the 



20 



8 . The method of claim 1 wherein said step of 



25 



(a) determining a set of candidate circles based 
on a Hough transform of said difference 



image ; 
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(b) scoring said set of said candidate circles 
based on a combination of at least three 
factors including, a fit factor 
representative of the fit of said candidate 
circles to said difference image, a location 
factor representative of the location of said 
candidate circles within said video, and a 
size factor representative of the size of 
said candidate circles; 
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(c) selecting at least one of said candidate 
circles based on said scoring; 

(d) generating at least one candidate facial 
region having an elliptical shape for each of 

5 said at least one of said candidate circles; 

and 

(e) scoring each of said candidate facial regions 
based on a combination of at least three 
factors including, a fit factor 

10 representative of the fit of a respective 

said candidate facial region to said 
difference image, a location factor 
representative of the location of said 
respective said candidate facial region 

15 within said video, and a size factor 

representative of the size of said respective 
said candidate facial region. 

9. The method of claim 8 wherein said generating 
20 at least one candidate facial region has a center of said 
elliptical shape located within a bounded region of 
potential locations having a greater vertical dimension than 
a horizontal dimension centered about the center of said 
respective said candidate circle. 
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A method of detecting a facial region within 
a video comprising the steps of 

(a) receiving a first "frame of said video 
comprising a plurality of pixels; 
3 0 (b) receiving a subsequent frame of said video 

comprising a plurality^ of pixels; 
(c) calculating a difference frame representative 
of the difference betweeV a plurality of said 
pixels of said first fram^ and a plurality of 
3 5 said pixels of said subsequent frame; 



£3 
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(d) determining a plurality of candidate facial 
regions within said difference frame; and 

(e) fitting said candidate facial regions to said 
^/ dif f erence\image to select one of said 

5 candidate facial regions based on a 

combination of at least two of the following 
three factors Vncluding, a fit factor 
representative XDf the fit of said candidate 
facial regions to said difference image, a 
10 location factor representative of the 

q location of said candidate facial regions 

Jlj within said video, and a size factor 

is. representative of the^ size of said candidate 

I* facial regions. 

= 11. The method of claim 10 where said determining 

't said candidate facial regions is based on a Hough transform 

:£ of said difference image in a spacial domain to a parameter 

space. 

5 20 

12. The method of claim 11 wherein said Hough 
transform is 

A(x c ,y c/ r) = A(x c ,y c/ r)+l V x c ,y c/ r e- (x-x c ) 2 + (y-y c ) 2 =r 2 . 

25 13. The method of claim 10 further comprising the 

step of thresholding said difference image thereby removing 
values of said difference image less than a threshold value. 



14. The method of claim 10 further comprising the 
3 0 step of scaling said first frame and said subsequent frame 
of said video to reduce the number of said pixels of said 
first and subsequent frame prior to said calculating said 
difference frame. 
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^5. A method ^>f determining sensitivity- 
information for a video comprising the steps of: 

(a) receiving a\ first frame of said video; 

(b) receiving aNsubsequent frame of said video; 

(c) determining a\ location of a facial region 
within said video based on said first and 
subsequent frames ; and 

(d) calculating a sensitivity value for each of a 
plurality of locations within said video 
based upon both saNid location of said facial 
region within said Video in relation to said 
plurality of locations and a non- linear model 
of the sensitivity of\ a human visual system. 

16. The method of claim 15 wherein the step of 
said calculating said sensitivity values is further based 
upon calculating an eccentricity versus image location in 
relation to a viewer of said video for said plurality of 
locations within said video. 

17. The method of claim 16 wherein said 
calculating said sensitivity is further based upon a 
sensitivity versus eccentricity non-linear model of said 
human visual system. 

18. The method of (claim 16 wherein said 
eccentricity is derived according to the following, 
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where 6 E is said eccentrVcity, y is a vertical pixel 
position within said vid4o, x is a horizontal position 
within said video, x c represents a horizontal component of a 
center position of an elliptical said facial region, y c 
represents a vertical component of said center position of 
said elliptical said facial Region, x r represents a first 
elliptical radii of said elliptical said facial feature in a 
horizontal direction; y r represents a second elliptical 
radii of said elliptical said facial feature in a vertical 
direction, and V represents a vising distance. 



15 



19. The method of claim 15 wherein said 
sensitivity values are based upon the distance from the 
outer edge of said facial region to said plurality of 
locations within said video. 



20 



20. The methodAof claim 17 wherein said 
sensitivity versus eccentrVcity non-linear model is derived 
according to the following, 



25 



where S is representative of sa^d sensitivity, 
constant, and 9 E is representati- 
sensitivity function. 



>-ECC 



is a 



ke of a non- linear contrast 
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steps of 



(a) 
(b) 
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A method of encoding\a video comprising the 

receiving a frame of skid video consisting of 
a plurality of pixels; 

calculating sensitivity information for a 
plurality of locations wiVhin said video 
calculated based upon the ^sensitivity of a 
human visual system of a viewer observing a 
particular region of said viaeo; and 



15 



\ „ frarce * * that Pr ° V 1 eS 

(O — dt A Sald llv Cuorm apparent o^ality <* 
a substantially unit vieuer 

said plural^ ° £ 10Catl ° " said particular 
* laid Viewer is observing said 

when saia y- 

■ « n f Said video, 
region or 

h d 0 \ cl.i» 21 wherein said encoding 
»• The meth0<J locations is based on a 

*-M^m 22 wherein said en 

«• ^ meth0d J h tXfoUowing^ 
a accordance with tne, 

- ^V,,. 0,, 0/--^\ ase Ration t actor, 

whe re 0 i. ^^Z^A* ^ 
^ q through S N are v locations, 
and S X tn y plurality of lopa 

information for sai \ 

* laim 23* wherein one of sax 
24 . The method of claim ^ based ^ pon a 

v , s a value from 1 to N, se nsitivity 
SkI where k is a va plurality of saia 

• calculation of a pi im age. 
statistical caic i oca tions of saia 

information for one of 

, , m 24 wherein S k is an 

«• Th : m tfv * - sitlvity intormation ' 

«f ^aid plurality ox 
average of saia v 

, P \laim 24 wherein S k is 
26 . The method of ^ tivicy intor.at.on. 

. mum o£ said plurality o{ saNW 

maximum oj. \ 
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27. The method of claim 21 wherein said encoding 
said frame results in the total number of bits produced for 
said frame being substantially equal to a preselected 
number . 

5 

28. The method of claim 27 wherein said frame is 
encoded only once . 

29. The method of claim 27 wherein said encoding 
10 of each of said plurality of locations is based on a 

respective quantization value representative of a base 
quantization factor divided by said sensitivity information 
for a respective one of said plurality of locations. 

15 30. The method tof claim 29 wherein said base 

quantization factor is derived in accordance with the 
following: 
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where A is representative of the\number of pixels in one of 
said plurality of locations, K and C are constants 
associated with said plurality of locations, N is 
representative of the number of said plurality of locations, 
25 B is representative of said total number of bits, the a* 
values are a measure how much texture Vs associated with 
said plurality of locations, and the sA values are 
representative of the respective said sensitivity 
information squared. 

?>2L. A method for encoding multiple blocks m a 
frame of image data, comprising: 

(a) identifying a target bit valueVequal to a 

total number of bits available tor encoding 
3 5 the frame; 
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calculating sensitivity information for each 
one of the plocks based upon the sensitivity 
of a human visual system observing a 
particular region of the image; 
adapting quantization values for each of the 
multiple blocks to provide substantially 
uniform apparent quality of each of the 
blocks in the frame subject to a constraint 
that the total nVimber of bits available for 
encoding the frame is equal to the target bit 
value; and \ 

encoding the blocks with the quantization 
values . \ 

The method of claim Bl wherein the 
quantization values are derived according to the following, 



20 where, Q i is the quantization value for\each block i, N is 
the number of blocks in the frame, B is\the total number of 
bits available for encoding the frame, Avis' a number of 
pixels in each of the multiple blocks, K \and C are constants 
associated with the image blocks, o i is aA empirical 

2 5 standard deviation of pixel values in the block, and Si is a 

weighting incorporating the sensitivity initormation for the 
block. \ 

33. The method of claim 31 including adjusting 

3 0 the quantization values according to a number of image 

blocks remaining to be encoded, a number of bits still 
available for encoding the remaining image blocks, and a 
value that depends on the sensitivity and texture of the 
remaining image blocks . 



(b) 



10 



(c) 



(d) 



15 



32 . 



• 
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34. The method of claim 32 including using a K 
parameter and a C parameter on a block-by-block basis to 
adjust the quantization values for each of the multiple 
blocks, the K parameter modeling correlation statistics of 
5 the pixels in the image blocks and the C parameter modeling 
bits required to code overhead data. 



35. The method of claim 34 including deriving the 
optimum quantization values in either a fixed mode where the 

10 K and C parameters are known in advance or an adaptive mode 
where the K and C parameters are derived according to the K 
and C parameters of previously encoded blocks. 

36. The method of claim 35 wherein the adaptive 
15 mode includes the following steps: 

(a) deriving values for the K and C parameters 
that exactly predict the number of bits B 
used for encoding previous blocks; 

(b) deriving averages for the derived K and C 

2 0 parameters for the previously encoded video 

blocks ; and 

(c) predicting the K and C parameters for a next 
video block by weighting the average K and C 
parameters according to the initial estimates 

25 for the K and C parameters. 



r y : + Al . A method for ^encoding video comprising the 
steps of : 

(a) detecting the location of a facial region of 
3 0 a frame of said video; 

(b) calculating a sensitivity value for each of a 
plurality of locations within said video 
based upon said location of said facial 
region; and 
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(c) encoding said frame in manner that provides a 
substantially uniform apparent quality of 
said plurality of locations to said viewer 
when said viewer is observing said facial 
region of said video. 

38. The method of (claim 37 wherein said 
sensitivity values are calculated based upon said location 
of said facial region, a size ot said facial region, and a 
non-linear model of the sensitivity of a human visual 
system. 



39. The method of claim 37 wherein said detecting 
said location of said facial region of said frame further 
15 comprises the steps of: 

(a) receiving a first frame of said video 
comprising a plurality of pixels; 

(b) receiving a subsequent frame of said video 
comprising a plurality of pixels; 

20 (c) calculating a difference image representative 

of the difference between a plurality of said 
pixels of said first frame and a plurality of 
said pixels of said subsequent frame; 

(d) determining a plurality of candidate regions 
25 within said difference image; and 

(e) fitting said plurality of candidate regions 
to said difference image to select said 
facial region. 



40. The method of claim 39 wherein said 
determining said plurality of candidate regions is based on 
a Hough transform of said difference image in a spacial 
domain to a parameter space. 
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41. The method of claim 39 further comprising the 
step of thresholding said difference image thereby removing 
values of said difference image less than a threshold value. 



5 42. The method of claim 41 wherein said threshold 

value is a predetermined value and said removing values is 
setting said values less than said threshold value to a 
selected value. 

10 43. The method of claim 40 wherein said Hough 

transform is 

A(x c/ y c ,r) = A(x c ,y c ,r)+1 V x c ,y c/ r e (x-x c ) 2 + (y-y c ) 2 =r 2 . 

44. The method of claim 39 where said fitting of 
15 each of said candidate regions is based on a combination of 

at least two of the following three factors including, a fit 
factor representative of a fit of said candidate regions to 
said difference image, a location factor representative of 
the location of said candidate regions within said video, 
20 and a size factor representative of the size of said 
candidate regions. 

45. The method of claim 39 further comprising the 
step of scaling said first frame and said subsequent frame 

2 5 of said video to reduce the number of said pixels of said 
first and subsequent frame prior to said calculating said 
difference frame. 

46. The method of claim 39 wherein said step of 
30 determining said plurality of said candidate regions and 

fitting said plurality of said candidate regions further 
comprises the steps of: 

(a) determining a set of candidate circles based 
on a Hough transform of said difference 
35 image; 



43 

(b) scoring said set of said candidate circles 
based on a combination of at least three 
factors including, a fit factor 
representative of the fit of said candidate 
5 circles to said difference image, a location 

factor representative of the location of said 
candidate circles within said video, and a 
size factor representative of the size of 
said candidate circles; 

10 (c) selecting at least one of said candidate 

circles based on said scoring; 
(d) generating at least one candidate region 

having an elliptical shape for each of said 
at least one of said candidate circles; and 

15 (e) scoring each of said candidate regions based 

on a combination of at least three factors 
including, a fit factor representative of the 
fit of a respective said candidate region to 
said difference image, a location factor 

20 representative of the location of said 

respective said candidate region within said 
video, and a size factor representative of 
the size of said respective said candidate 
region. 

25 

47. The method of claim 46 wherein said 
generating at least one candidate region has a center of 
said elliptical shape located within a bounded region of 
potential locations having a greater vertical dimension than 
30 a horizontal dimension centered about the center of said 
respective said candidate circle. 
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48. The method of claim 38 wherein said wherein 
the step of said calculating said sensitivity values is 
further based upon calculating an eccentricity versus image 
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«f said video for said 
location in relation to a^i-r^f ^. 
plurality o£ locations within said 

„ The method o£ claim 48 wherein said 

49. ine me following, 
eccentricity is derived according to 



10 



■ v is a vertical pixel 

where 6 E is said eccentricity Y I ^ . zontal posit ion 

5 position within said «*o- \ orlzon tal cogent of a 

within said video, x J V, faciai reg ion, y. 

center position of an ell P y posltlo „ of 

represents a "^^^l \ represents a first 
said elliptical said facial reg X ^ feature in a 

:0 elliptical radii of said ellipti ^ 

horizontal direction; y repr \ . n ^ vertical 

radii o£ said ^ > :r:ing * . 
direction, and V repress y 

j ^ Haim 38 wherein said 
. 5 \r s iXel : :: the distance fro. the 

r^rof slid U -ion to said Polity of 

locations within said video. 
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, ^ r-iaim 38 wherein said 
t ^ ell ircen-rnon-linear ™odel is derived 



sensitivi . 

t-r, the following, 
according to tne 
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where S is representative of said sensitivity, k ECC is a 
constant, and 0 E is representative of a non-linear contrast 
sensitivity function. 

52. The methoNi of claim 37 further comprising the 
step of encoding said fr&^n^ manner that provides a 
substantially uniform app^ntNjuality of said plurality of 
locations to said viewer w\e\ said viewer is observing said 
facial region of said video. 

53. The method of claim 37 wherein said encoding 
of each of said plurality of locations is based on a 
respective quantization value representative of a base 
quantization factor divided by said sensitivity information 
for a respective one of said plurality of locations. 

54. The method Ve claim 53 wherein said encoding 
is derived in accordance wito the following: 

Q/S lf Q,S 2 , Q/S 3 , . . ,Q/S N 
where Q is representative of kid base quantization factor, 
and S x through S N are representative of said sensitivity 
information for said plurality ot locations . 

55. The method of claim 54 wherein one of said 
S k , where k is a value from 1 to N, is derived based upon a 
statistical calculation of a plurality of said sensitivity 
information for one of said locations of said image. 

56. The method of claim 55 wherein S k is an 
average of said plurality of said sensitivity information. 

57 . The method of claim 55 wherein S k is a 
maximum of said plurality of said sensitivity information. 



58. The method of claim 52 wherein said encoding 
said frame results in the total number of bits produced for 
said frame being substantially equal to a preselected 
number . 

59. The method of claim 58 wherein said frame is 
encoded only once. 

60. The method of claim 58 wherein said encoding 
of each of said plurality of locations is based on a 
respective quantization value representative of a base 
quantization factor divided by said sensitivity information 
for a respective one of said plurality of locations. 

61. The method o£ claim 60 wherein said base 
quantization factor is derived in accordance with the 
following: \ 



where A is representative of the Yi umber of pixels in one of 
said plurality of locations, K and\ C are constants 
associated with said plurality of Pocations, N is 
representative of the number of saick plurality of locations, 
B is representative of said total number of bits, the o ± 2 
values are a measure how much texture Vs associated with 
said plurality of locations, and the values are 
representative of the respective said sensitivity 
information squared. \ 



